Audio-visual television broadcast programs processing, transcription, indexing and searching
نویسندگان
چکیده
This paper describes the development of a system for automatic television broadcast news processing, transcription and indexing. The main task of our system is automatic transcription of television broadcast programs from audio signal. The transcribed recordings are indexed and saved to the database, therefore as the second task we have created the web-system for searching in the database. It is possible to search information in the database according to key words (sentences) or according to who was a speaker. Time boundaries of single words or audio segments are saved to the database too during the indexing phase of the processing, therefore we can compare found information from the database with the original recording very easily. The visual information from television recordings is processed in our system too. The modules for visual signal segmentation, for face detection and identification, and for visual speech detection have been added to the transcription system. Indexed recognized visual information is saved to the database together with the information from acoustic signal and it is included in the searching web-system.
منابع مشابه
Audio-Visual Speaker Recognition for Video Broadcast News
Signi cant progress has been made in the transcription of the audio stream in the broadcast news domain for both radio news and TV news (HUB4 task). Such transcripts provide an excellent means of indexing video content for search and retrieval. Speaker identi cation is an important technology in this domain both for selecting high-accuracy speaker-dependent models for transcription and as an in...
متن کاملThe Effect of Broadcast Digitalization on Agricultural Information Dissemination in Nigeria.
Broadcast digitalization with its enormous benefits to the broadcasting industry will improve the quality of content of programs delivered by television stations. Africa has a switchover date of June, 2017. For Nigerians to have access to television broadcast once the switch over is completed, they must purchase high definition television sets or the set-up box. The awareness among urban dwelle...
متن کاملStructuring Broadcast Audio for Information Access
One rapidly expanding application area for state-of-the-art speech recognition technology is the automatic processing of broadcast audiovisual data for information access. Since much of the linguistic information is found in the audio channel, speech recognition is a key enabling technology which, when combined with information retrieval techniques, can be used for searching large audiovisual d...
متن کاملBrowsing and Retrieval of Full Broadcast-Quality Video
In this paper we describe a system we have developed for automatic broadcast-quality video indexing that successfully combines results from the fields of speaker verification, acoustic analysis, very large vocabulary speech recognition, content based sampling of video, information retrieval, natural language processing, dialogue systems, and MPEG2 delivery over IP. Our audio classification and ...
متن کاملParallel Algorithms for Indexing and Retrieval in Audio Databases
Recent explosion in the use of non-text, multime-dia data such as audio, video, images, and graphics necessitates the development and use of multimedia databases. EEcient schemes for indexing and searching in multimedia databases are essential for fast and sophisticated data retrieval. The highly complex nature of audio/visual query processing, and the vast amounts of data in multimedia databas...
متن کامل